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Abstract 

This paper presents a self-contained account for coupling arguments and ap- 
plications in the context of Markov processes. We first use coupling to describe 
the transport problem, which leads to the concepts of optimal coupling and prob- 
ability distance (or transportation-cost), then introduce applications of coupling to 
the study of ergodicity, Liouville theorem, convergence rate, gradient estimate, and 
Harnack inequality for Markov processes. 
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1 What is coupling 

A coupling for two distributions (i.e. probability measures) is nothing but a joint distri- 
bution of them. More precisely: 

Definition 1.1. Let (E,^) be a measurable space, and let & J^{E), the set of all 
probability measures on (i?, ^). A probability measure vr on the product space {E x 
E,^ X ^) is called a coupling of /i and if 

n{A X E) = /i(A), Ti{E X A) = iy{A), Ae,^. 
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We shall let ^{fi, v) to stand for the set of all couplings of /i and v. Obviously, 
the product measure /i x z/ is a coupling of /i and i/, which is called the independent 
coupling. This coupling is too simple to have broad applications, but it at least indicates 
the existence of coupling. Before moving to more general applications of coupling, let us 
present a simple example to show that even this trivial coupling could have non-trivial 
applications. Throughout the paper, we shall let //(/) denote the integral of function / 
w.r.t. measure /i. 

Example 1.1 (The FKG inequality). Let fi and v he probability measures on M, then for 
any two bounded increasing functions f and g, one has 

fi{fg) + uifg)>KfHg) + i^{f)fiig). 

Proof. Since by the increasing monotone properties of / and g one has 

{f{x)-f{y))igix)-giy))>0, x, y G M, 

the desired inequality follows by taking integral w.r.t. the independent coupling fixu. □ 

In the remainder of this section, we shall first link coupling to transport problem, 
which leads to the notions of optimal coupling and probability distances, then introduce 
coupling for stochastic processes. 



1.1 Coupling and transport problem 

Let xi, X2, - ■ ■ , Xn be n places, and consider the distribution /i := {/ij : i = 1, ■ ■ ■ , n} of 
some product among these places, i.e. /ij refers to the ratio of the product at place Xi. 
We have fii > and /ij = 1; that is, /i is a probability measure on E := {1, ■ ■ ■ ,n}. 
Now, due to market demand one wishes to transport the product among these places to 
the target distribution u := {ui : 1 < i < n}, which is another probability measure on 
E. Let vr := {vTjj : 1 < i, j < n} be a transport scheme, where VTjj refers to the amount to 
be transported from place Xj to place Xj. Obviously, the scheme is exact to transport the 
product from distribution /i into distribution u if and only if tt satisfies 

n n 
Hi = ^ TTij, Uj = ^ TTij, 1 < i, j < n. 

j=l i=l 

Thus, a scheme transporting from yU to is nothing but a coupling of fi and u, and vice 
versa. 

Now, suppose pij is the cost to transport a unit product from place Xi to place Xj. 
Then it is reasonable that p gives rise to a distance on E. With the cost function p, the 
transportation cost for a scheme vr is 

n „ 
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Therefore, the minimal transportation cost between these two distributions is 

W({fi,v):= inf / pdTT, 

which is called the L^-Wasser stein distance between fi and u induced by the cost function 
P- 

In general, let {E, ^) be a measurable space and let p be a non-negative measurable 
function on E x E. For any p > 1 

(1.1) iyP(;,, i.) := I inf / p^'dvrl 

is also called the L^-Wasserstein distance (or the transportation cost) between prob- 
ability measures /i and v induced by the cost function p. In general, is not really a 
distance on ^{E), but it is a distance on ^p{E) := {/i G P : p G L^^p x /i)} provided p 
is a distance on E (see e.g. |6]). 

It is easy to see from (11. ip that any coupling provides an upper bound of the trans- 
portation cost, while the following Kontorovich dual formula enables one to find lower 
bound estimates. 

Proposition 1.2 (Kontorovich dual formula). Let = {{f,g) '■ f,g ^ ^b{E),f{x) < 
g{y) + p{x,yy, G E}, where ^b{E) is the set of all bounded measurable functions on 
E. Then 

WP{p,uY= sup {p(f)-u{g)}. 

When (i?, p) is a metric space, ^b{E) in the definition of can be replaced by a sub- 
class of bounded measurable functions determining probability measures (e.g. bounded 
Lipschitzian functions), see e.g. [20] . 

1.2 Optimal coupling and optimal map 

Definition 1.2. Let p, z/ G 0^{E) and p > on E x E be fixed. If vr G "^{p.v) 
reaches the infimum in (11.11) . then we call it an optimal coupling for the transportation 
cost. If a measurable map T : E ^ E maps p into v (i.e. v = p o T~^), such that 
7r(da;, dy) := p(dx)5a;(d?/) is an optimal coupling, where 5^ is the Dirac measure at x, then 
T is called an optimal (transport) map for the U' transportation cost. 

To fix (or estimate) the Wasserstein distance, it is crucial to construct the optimal 
coupling or optimal map. Below we introduce some results on existence and construction 
of the optimal coupling/map. 

Proposition 1.3. Let {E,p) be a Polish space. Then for any /i, z/ G ^{E) and any p > 1, 
there exists an optimal coupling. 
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The proof is fundamental. Since it is easy to see that the class "^(/x, ly) is tight, for a 
sequence of couplings {vr„}„>i such that 

n— >oo 

there is a weak convergent subsequence, whose weak limit gives an optimal coupling. 

As for the optimal map, let us simply mention a result of McCann for E = M'^, see 
[25] and references within for extensions and historical remarks. 



Theorem 1.4 ([H]). Let E = M'', p(x,?/) = \x — y\, and p = 2. Then for any two 
absolutely continuous probability measures fj,{dx) := f{x)dx and ^{dx) := g{x)dx such 
that / > 0, there exists a unique optimal map, which is given by T = W for a convex 
function V solving the equation 

f = g{VV)deWacVV 

in the distribution sense, where Vac is the gradient for the absolutely continuous part of 
a distribution. 

Finally, we introduce the Wasserstein coupling which is optimal when p is the discrete 
distance on E; that is, this coupling is optimal for the total variation distance. 

Proposition 1.5 (Wasserstein couphng). Let p{x,y) = l^^^yy. We have 

W;^{fi,iyY =l\\fi-u\lar ■■= sup \fi{A) - u{A)\, 
and the Wasserstein coupling 

{p - u)+{dx){p - iyy{dy) 



7r(dx, dy) := (/i A u){dx)5x{dy) + 



{p-uy{E) 



is optimal, where {p — z/)"*" and {p — v) are the positive and negative parts respectively in 
the Hahn decomposition of p — u, and ^/\v = p — [p — v)^ . 



1.3 Coupling for stochastic processes 

Definition 1.3. Let X := {Xt}t>o and Y := {Yt}t>o be two stochastic processes on E. 
A stochastic process (X, Y) on E x E is called a coupling of them if the distributions of 
X and Y coincide with those of X and Y respectively. 

Let us observe that a coupling of two stochastic processes corresponds to a coupling 
of their distributions, so that the notion goes back to coupling of probability measures 
introduced above. 
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Let yU and v be the distributions of X and Y respectively, which are probabihty 
measures on the path space 

W := equipped with product a-algebra ^(W) := a{w ^ Wt : t e [0, oo)). 

For any vr € ^(/i, v), {W x ^{W) x ^{W), vr) is a probabihty space under which 

(X, := (wS w^), w = w^) e X 

is a couphng for X and Y . Conversely, the distribution of a coupling for X and Y also 
provides a coupling for /i and z/. 

2 Some general results for Markov processes 

Let Pt and Pt{x^dy) be the semigroup and transition probability kernel for a strong 
Markov process on a Polish space E. If X := {Xt)t>o and F := (yt)t>o are two processes 
with the same transition probability kernel Pt{x,dy), then {X,Y) = {Xt,Yt)t>o is called 
a coupling of the strong Markov process with coupling time 

T,,, := inf{t >0:Xt = Yt}. 

The coupling is called successful if T^^y < oo a.s. For any fi G ^{E), let be the 
distribution of the Markov process with initial distribution /x, and let /iPt be the marginal 
distribution of at time t. 

Definition 2.1. If for any x,y & E, there exists a successful coupling starting from (x, y), 
then the strong Markov process is said to have successful couphng (or to have the couphng 
property) . 

Let 

^ = Pi a{u; Us : s>t) 
t>o 

be the tail cx-filed. The following result includes some equivalent assertions for the coupling 
property. 

Theorem 2.1 ([3 [HI |2l]). Each of the following is equivalent to the coupling property: 

(1) For any e ^{E), \imt^^ \\fiPt - uPt\\yar = 0. 

(2) All bounded time-space harmonic functions are constant, i.e. a bounded measurable 
function u on [0, oo) x E has to be constant if 

u{t,-) = Psu{t + s,-), s,t>0. 
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(3) The tail a-algehra of is trivial, i.e. P^{X & A) = or 1 holds for fi G J^{E) and 

sr. 

(4) For any /i, z/ G ^(^), W = W holds on ST. 

A weaker notion than the couphng property is the shift-couphng property. 

Definition 2.2. The strong Markov process is said to have the shift-couphng property, 
if for any x,y & E there is a couphng {X, Y) starting at (x, y) such that Xt^ = Yt^ holds 
for some finite stopping times Ti and T2. 



be the shift-invariant cr-field. Below are some equivalent statements for the sift-coupling 
property. 

Theorem 2.2 ([2l[71[2l]). Each of the following is equivalent to the shift- coupling property: 

(5) For any fi,iy e ^{E), \imt^^ j /J H/iP^ - i^Ps\\vards = 0. 

(6) All bounded harmonic functions are constant, i.e. a bounded measurable function f 
on E has to be constant if Ptf = f holds for all t >0. 

(7) The invariant a-algebra of the process is trivial, i.e. P^{X G A) = or 1 holds for 
/i G ^{E) and A^J. 

(8) For any /i, z/ G ^(^), = holds on J . 

According to Theorem 5], the coupling property and the shift-coupling property 
are equivalent, and thus all above statements (l)-(8) are equivalent, provided there exist 
s,t > and increasing function $ G C([0, 1]) with $(0) < 1 such that 



holds, where osc(/) := sup / — inf /. 

By the strong Markov property, for a coupling (X, F) with coupling time T, we may 
let Xt = Yt for t > T without changing the transition probability kernel; that is, letting 



(2.1) \PJ{x) - PJ{y)\ = \E{fiXt) - f{Y,))\ < osc(/)P(r,,, > t), / G ^b{E). 



This implies the following assertions, which are fundamentally crucial for applications of 
coupling in the study of Markov processes. 



Let 



:= {Ae ^{W) : w G A implies w{t + ■) ^ A, t > O} 



Ptf<HPt+sf), 0</<l 




the process (X, Y) is again a coupling. Therefore, for any x,y E E and any couphng 
{X,Y) starting at {x,y) with coupling times Tr^^y, we have 
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(i) If limy^^. P(r^,y >t) = 0,x e E, then Pt is strong Feller, i.e. Pt^b{E) C Cb{E). 

(ii) Let /i be an invariant probability measure. If the coupling time Tj. ^ is measurable 
in (x, y), then 

pPt - f^Wvar < 2 / F{T,,y > t)7r(dx, dy), 71 G ^(/i, z/) 

holds for z/ e .^(E"). 

(iii) The gradient estimate 

i^Pffw V \Ptf{y)-Ptf{x)\ ^ nT.,y > t) 

\\/Ptj[x)\ := hmsup < osc(/) limsup — - — , x e E 

y^x p{x,y) y^x p{x,y) 

holds. 

By constructing coupling such that fiT^^y > t) < Ce~^^ holds for some C, A > 0, we 
derive lower bound estimate of the spectral gap in the symmetric case (see [HI [9]). 

3 Derivative formula and Harnack inequality for dif- 
fusion semigroups 

To make our argument easy to follow, we shall only consider the Brownian with drift on 
Mf^. But the main idea works well for more general SDEs, SPDEs and Neumann semigroup 



on manifolds with (non-convex) boundary (see [SlIIIlIISlinillSlEIlEHlEniEIlEl 

and references within). 

Consider the diffusion semigroup generated by L := |A + Z ■ V on R'^ for some 
Z G Cj[(M'^,M'^). Let t;, x G M"^, t > be fixed. Consider V„Pt/(a;), the derivative of Ptf 
at point X along direction v, for / G I^biM'^)- It is well known that the diffusion process 
starting at x can be constructed by solving the Ito SDE 

dXs = dBs + Z{Xs)ds, Xo = x, 

where Bg is the d-dimensional Brownian motion. We have Ptf{x) = Kf{Xt). 

Theorem 3.1 (Derivative formula). For any f G ef^b(M'^) and x,f G M.'^, 

V,Ptf{x) = ^e|/(X,) j\{t - s)W,Z{Xs) + V, d5,)|, t > 0. 

Proof. For any e > 0, let X^ solve the equation 

dX| = dB, + Z{X,)ds - -V ds, X^ = x + ev. 
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Then X| - X, 



ejt-s) 



V. In particular, Xf = Xf. To formulate Ptf{x + ev) using X^, let 



B, = B, + [z{Xr) - Z{Xl) - -^v] 



'^v |>dr, s < t, 



which is Brownian motion under the probability measure dP^ := -RedP, where 



i?£ := exp 



z{xi) - z{x;) + -V, di?,> - - / z{xi) - z{x,) + -V 



Reformulate the equation of X^ using Bg-. 

<1XI + <1B, + Z{Xl)<ls, Xl = x + ev. 

We have 



Therefore, 



Ptf{x + ev) = ^^J{Xt) = nRefiXt)]. 



ds 



V,PJ(x) = lim 

£-S>0 



Ptf{x + ev)-Ptf{x) 



£-^0 e 



□ 



We remark that this kind of integration by parts formula is known as Bismut (or 
Bismut-Elworthy-Li) formula. But our formulation is slightly different from the Bismut- 
Elworthy-Li ones using derivative processes (see |1| IT^)- 

Next, we turn to consider the Harnack inequality of P^, which enables one to compare 
values of Ptf at different points for / > 0. To this end, one may try to ask for an inequality 
like 

Ptf{x) < Cit, X, y)Ptfiy), X, y G t > 0, 

where C : (0, oo) x M?'^ — )■ (0, oo) is independent of /. It turns out that this inequality 
is too strong to be true even for Z = (see [27] for an criterion on existence of this 
inequality) . Therefore, people wish to establish weaker versions of the Harnack inequality. 
Using maximum principle Li-Yau established their dimension-dependent Harnack 
inequality with a time-shift, while using a gradient estimate argument the author [26] 
found a dimension-free Harnack inequality with powers. Both inequalities have been 
widely applied in the study of heat kernel estimates, functional/cost inequalities and 
contractivity properties of diffusion semigroups, but the latter applies also to infinite 
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dimensional models, see [H [3l [TTl [HI [281 ISSl [3Ql [311 [31] and references within. Below, we 
shall introduce a coupling method for the dimension-free Harnack inequality. 
Let ?7 be a positive continuous function. Consider the coupling 



dXs = Z{Xs)ds + dBs, Xo = X 
X - K 



dK 



ds + dBs, Yo = y. 



The additional drift rjs ■ ^x'-y'i second equation forces Yt moves to Xt, and with a 

proper choice of function rj, the force will be strong enough to make the two process move 
together before time t. We shall solve the second equation up to the coupling time 

r := inf{s > : X, = 

and let Xs = Yg for s > t. Assume that 

(3.1) {Z{xi) — Z{x2),Xi — X2) < K\xi — X2\'^, Xi,X2G]R'^ 

holds for some constant K. Then 

d\X,-Y,\<{K\Xs-Ys\-Vs}ds, s < r. 



This implies that 



Taking 



„-/f(rAt)| v- V \ <^ 



x-y\ 



-!<! 



Vsds. 



-Ks 



\x — y\e' 
/oe-2^Ms 



s > 0, 



we see \x — y\ — f^e ^'^r]sds = 0, so that t <t. Now, let 



R = exp 



\X,c, — 



■(x,-y„d5,) 



By the Girsanov theorem, under the probability RdF, the process Yt is associated to Pt- 
Therefore, 

PJ{y) = E[Rf{Yt)] = E[Rf{Xt)] < {Ptn^))"" {^R"'^"-'^)^""'^'" ■ 
By estimating moments of -R, we prove the following result. 

Theorem 3.2 (Dimension-free Harnack inequality). // (IJ. ip holds for some constant 
G M, then 

pK\x — y\^ 



(3.2) 



[Ptf{x)Y <{PtF{y))ew 



_2(p- 1)(1 -e-^*) 
holds for p > 1, non-negative function f and x,y G MJ^, t > 0. 
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According to [30], for any p > 1 the Harnack inequality (13. 2p implies the log-Harack 
inequality 

K\x — y\ 



Pt\ogf{x) < logPJiy) + J' , x,ye / G ^b(M'^),/ > 1. 



Below, we present a simple extension of this inequality to the case with a non-constant 
diffusion coefficient. 

Theorem 3.3 ([HI EI]). let a : R'^ (g) M'^ be Lipschitzian such that a* a > XI and 

(3.3) \\a{x) - a{y)\\ls + 2(x - y, Z{x) - Z{y)) < K\x - y\\ x, y G 

hold for some constants A > and K G M. Then the semigroup Pt generated by 



L := ^ ^ {a*a)ijdidj + ^ Zidi 

i,j=l i=l 

satisfies the log-Harnack inequality 

K I I ^ 

Pt log f{x) < log Ptf{y) + ^^^^~^.Kty 2/ e / G ^.(M'^), / > 1. 

There are two different ways to prove this result using coupling, one is due to [2T] 
through an L^-gradient estimate, the other is due to [31] using coupling and Girsanov 
theorem. Let us briefly introduce the main ideas of these two arguments respectively. 

Proof of Theorem \3.3\ using gradient estimate. Consider the coupling 

dXt = Z{Xt)dt + aiXt)dBt, Xo = X, 
dYt = Z{Yt)dt + a{Yt)dBt, Yo = y. 

It follows from the Ito formula and (13. 3p that 

E\Xt-Yt\^ < e-^l^-^lV-y|- 
Combining this with the Schwartz inequality we obtain the L^-gradient estimate 

|VP,/(.)P = Urn (mW-nY.)\y ^ ^,,,^\fW-Hy.}P ^ e'«P.|V/P(.) 

y^x \ \x — y\ / y^x \Xt — Yt\^ 

for / G C^(M'^). Up to an approximation argument, this implies that for / G Cfe(M'') with 
/ > 1, and for h G C^([0, tf) such that Hq = 0,ht = 1, 

^PshgPt-sfiy + ix-y)h,) 
ds 

= {h'{s){VPJogPt.J,x- y) - AP,|VlogPt_J|2}((x - y)K + y) 
^ < {\K\ ■ \x - y|e^^/2p,|VlogP,„J| - AP,|VlogP,_J|2}((x - y)K + y) 

<^^^k-2/r, se[0,t]. 
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Taking 



1 -e" 
1 -e- 



-Ks 



-Kt ' 



S > 



and integrating both sides of ()3.4p over [0, t], we prove the desired log-Harnack inequahty. 

□ 

Proof of Theorem \3. 3\ using Girsanov theorem. Let = -^(1 — e^*^*~*^),s E [0,t]. Con- 
sider the couphng 



dXs = Z{Xs)ds + a{Xs)dBs, Xq = x 

l[o,t)(s) 



dYs = Z{Ys)ds + a{Ys)dBs 



aiYMXs)-\Xs-Ys)ds, Yo = y. 



From the assumption it is easy to see that 

Rs:=exp - r-{a{Xr)-\Xr-Yr),dBr) ^ 
Jo ^ 

is a uniformly integrable martingale with 



1 r \o{XrY^{Xr-Yr^ 



-dr 



s e [o,t\ 



(3.5) 



E[RthgRt] < 



K\x — 
2A(1 -e-^*)' 



Moreover, X^ = Yt holds {Rt dP)-a.s. Therefore, by the Girsanov theorem, (13 .Sp and the 
Young inequality, we obtain 



Pt\ogf{y) = E[i?tlog/(Fi)] = E[i?tlog/(Xi)] < logE/(Xi) + E[i?i log 



<\og Ptf{x) 



K\x — 
2A(1 -e-^*)' 



□ 



4 Coupling for jump processes and applications 

For a jump process, the path will be essentially changed if a non-trivial absolutely con- 
tinuous drift is added. This means that the coupling we constructed above for diffusions 
with an additional drift is no longer valid in the jump case. Intuitively, what we can do 
is to add a "random jump" in stead of a drift. This leads to the study of 
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4.1 Quasi-invariance of random shifts 

Let X be a jump process on W^, let be a random variable on W^, and let r be a random 
time. We aim to find conditions to ensure that the distribution of X + ,^l[,-^oo) is absolutely 
continuous with respect to that of X. 

We start from a very simple jump process, i.e. the compound Levy process. L° be 
the compound Poisson process on M'^ with Levy measure uq. Let Aq be the distribution 
of Lo, which is a probability measure on the path space 

oo 

W := { ^xa[t„oo) ■.XieR''\ {0}, < t oo as z t oo}. 

i=l 

Let Aut = Ut — Ut- foT u & W and t > 0. 

Theorem 4.1 ([32]). The distribution o/Lq + ^l[r,oo) is absolutely continuous with respect 
to Aq if and only if the joint distribution of (Lq, ^, r) has the form 

Ao{duj)5o{dz)Q{u},dt) + g{uj, z,t)Ao{duj)i^{dz)dt, 

where g is a non-negative measurable function on W x M."^ x [0,oo), and 0(0;, dt) is a 
transition measure from W to [0, oo). In this case, the distribution of Lq + ^l[r,oo) is 

(4.1) |p(e = 0)+ J2 9{^- ^^tl[t,oo),Aut,t)\Ao{du). 

We note that (14.11) is an revision of the Mecke formula on Poisson spaces. By using 
quasi-invariant random shifts given in Theorem 14. H we are able to investigate 

4.2 Coupling property for O-U processes with jump 

Let L := {Lt}t>Q be the Levy process with Levy measure u (Possibly also with Gaussian 
and drift parts). Let A be a d x d-maiYix. Let Pt and Pt{x, dy) be the transition semigroup 
and transition probability kernel for the solution to the linear SDE 

dXt = AXtdt + dLt. 

Theorem 4.2 ([32] ). Let {Ax,x) < hold for x E R'^. If v > po{z)dz such that 

PQ{z)~^dz < oo 



' {\z-zo\<e} 

holds for some zq G and some e > 0, then 



\P,{x, ■) - Pt{y, ■) Ur < ^^^^J^ x,yeR',t>0 



holds for some constant C > 0. 
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Remark, (a) The condition Po(^) ^d^ < oo is very weak, as it holds provided 

Pq has a continuous point zq G M'^ such that Po{zq) > 0. Successful couplings have also 
been constructed in [23] under a slightly different condition. 

(b) The convergence rate we derived is sharp. To see this, let ■ |^ + 1) < oo. For 
the compound Poisson process there exists c > such that 

\\Pt{x, ■) - Pt{y, OILar > ^, t > 1 + |x - y\\ 

(c) The appearance of 1 in the upper bound is essential if A := //(M'^) < oo, as in this 
case with probability e~^* the process does not jump before time t, so that 

||Pi(x, ■) - Ptiy, Ollmr > 2e-^*, t>0,xy^y. 

Similarly to what we did for the diffusion case, we can use the coupling argument to 
investigate 

4.3 Derivative formula and gradient estimate 

Let u > uq := pQ{z)dz such that 

Ao := MM.'^) < oo. 

The compound Poisson process with Levy measure z/q can be formulated as 

Nt 
i=l 

where A''^ is the Poisson process with rate Aq and {^i} are i.i.d. random variables indepen- 
dent of {Nt)t>Q with common distribution uq/Xq. Let be the Levy process independent 
of L° such that L := L° + is the Levy process with Levy measure u. Let Tj be the z-th 
jump time (or ladder time) of Nf. Let be the solution to the liner SDE with initial 
value X. Consider the gradient of 

P//(x) :=E{/(Xniw<o}. 
Theorem 4.3 ([33]). Let po E Cl(R'^) such that i^{dz) > po{z)dz and 



/ sup |Vpo|(a^)d2; < oo 

jRd. \x-z\<e 



holds for some e > 0. Then for any t > and f G 



1 

VP,' fix) = E{/(Xf)l{;v.>i}^$^e^*-»Vlogpo(e 



* ^=l 
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Next, by the above derivative formula and comparing the small jump part with subor- 
dinations of the Brownian motion, we obtain the following result on the gradient estimate 
of Pt, which is much stronger than the strong Feller property. 

Theorem 4.4 ([33]). Let A < -61 and 

u{dz) > \z\-''Si\z\-^)^,\^ro}dz 
hold for some Tq > and Bernstein function S such that S{0) = and 



POO 1 

a(t) := / — e-*"^(")dr < oo, t > 0. 
Jo Vr 



Then there exist two constants Cq, Ci > such that 

||VPi/||oo < Cie"^^*a(co(t A l))||/||oo, / e > 0. 

// in particular A = then 

||VPi/||oo < ci(a(cot) + -)||/||oo, / e > 0. 

Obviously, if limj,_j.oo = oo then a{t) < oo holds for all t > 0. Concretely, if 
u{dz) > c|2|-('^+")l{|^|<^,} (i.e. S{r) = (cr)"/^) for some c' > and a G (0,2) then 
Ci(t) < 7T/^, t > 0, and hence. 



IVPJIloo < 



(t A 1)1/° 



For detailed proofs of the above results and further developments on couplings and appli- 
cations of Levy processes, one may check with recent papers [5], [221 1231 [321 [33] . 



References 

[1] S. Aida and T. Zhang, On the small time asymptotics of diffusion processes on path 
groups, Pot. Anal. 16(2002), 67-78. 

[2] D. Aldous, H. Thorisson, Shift- coupling, Stoch. Proc. Appl. 44(1993), 1-14. 

[3] M. Arnaudon, A. Thalmaier, and F.-Y. Wang, Harnack inequality and heat kernel 
estimates on manifolds with curvature unbounded below. Bull. Sci. Math. 130(2006), 
223-233. 

[4] J. M. Bismut, Large Deviations and Malliavin Calculus, Birkhauser, Boston, 1984. 



14 



[5] B. Bottcher, R. L. Schilling, J. Wang, Constructions of coupling processes for Levy 
processes, larXiv: 1009.551 II 

[6] M.-F. Chen, From Markov Chains to N on- Equilibrium Particle Systems, World Sci- 
entific, Singapore, 1992. 

[7] M. Cranston, A. Greven, Coupling and harmonic functions in the case of continuous 
time Markov processes, Stoch. Proc. Appl. 60(1995), 261-286. 

[8] M.-F. Chen, F.-Y. Wang, Application of coupling method to the first eigenvalue on 
manifold, Sci. in China (A) 40(1997), 384-394. 

[9] M.-F. Chen, F.-Y. Wang, Estimates of spectral gap for elliptic operators. Trans. 
Amer. Math. Soc. 349(1997), 1239-1267. 

[10] M. Cranston, F.-Y. Wang, A condition for the equivalence of coupling and shift- 
coupling, Ann. Probab. 28(2000), 1666-1679. 

[11] G. Da Prato, M. Rockner, F.-Y. Wang, Singular stochastic equations on Hilhert 
spaces: Harnack inequalities for their transition semigroups, J. Funct. Anal. 257 
(2009), 992-017. 

[12] K.D. Elworthy, X.-M. Li, Formulae for the derivatives of heat semigroups, J. Funct. 
Anal. 125(1994), 252-286. 

[13] A. Es-Sarhir, M.-K. v. Renesse and M. Scheutzow, Harnack inequality for functional 
SDEs with hounded memory. Electron. Commun. Probab. 14 (2009), 560-565. 

[14] H. Kawabi, The parabolic Harnack inequality for the time dependent Ginzburg- 
Landau type SPDE and its application. Pot. Anal. 22(2005), 61-84. 

[15] P. Li, S.-T. Yau, On the parabolic kernel of the Schrddinger operator. Acta Math. 
156(1986), 153-201. 

[16] T. Lindvall, Lectures on the Coupling Methods, Wiley, New York, 1992. 

[17] W. Liu, Harnack inequality and applications for stochastic evolution equations with 
monotone drifts, J. Evol. Equ. 9 (2009), 747770. 

[18] W. Liu, F.-Y. Wang, Harnack inequality and strong Feller property for stochastic 
fast diffusion equations, J. Math. Anal. Appl. 342(2008), 651-662. 

[19] R. J. MacCann, Existence and uniqueness of monotone measure-preserving maps, 
Duke Math. J. 80(1995), 309323. 

[20] S.T. Rachev, Probability Metrics and the Stability of Stochastic Models, Wiley, 
1991. 



15 



[21] M. Rockner, F.-Y. Wang, Log-Harnack inequality for stochastic differential equations 
in Hilbert spaces and its consequences, Infin. Dimens. Anal. Quant. Probab. Relat. 
Topics 13(2010), 27-37. 

[22] R. L. Schilling, J. Wang, On the coupling property of Levy processes, to appear in 
Ann. Inst. H. Poincare: Probab. Stat., |ar^vil006.5288j 

[23] R. L. Schilling, P. Sztonyk, J. Wang Coupling Property and Gradient Estimates of 
Ley Processes via the Symbol, available online [a"rXiv:1011. 10671 

[24] H. Thorisson, Shift- coupling in continuous time, Probab. Theory Relat. Fields 
99(1994), 477-483. 

[25] C. Villani, Optimal transport: Old and New, Springer, Berlin, 2009. 

[26] F.-Y. Wang, Logarithmic Sobolev inequalities on noncompact Riemannian mani- 
folds, Probab. Theory Relat. Fields 109(1997), 417-424. 

[27] F.-Y. Wang, A Harnack-type inequality for non-symmetric Markov semigroups, J. 
Funct. Anal. 239(2006), 29-309. 

[28] F.-Y. Wang, Dimension-free Harnack inequality and its applications. Front. Math. 
China 1(2006), 53-72. 

[29] F.-Y. Wang, Harnack inequality and applications for stochastic generalized porous 
media equations, Ann. Probab. 35(2007), 1333-1350. 

[30] F.-Y. Wang, Harnack inequalities on manifolds with boundary and applications, J. 
Math. Pures Appl. 94(2010), 304-321. 

[31] F.-Y. Wang, Harnack inequality for SDE with multiplicative noise and extension to 
Neumann semigroup on non-convex manifolds, to appear in Ann. Probab., available 
online l arXiv:0911.164l 

[32] F.-Y. Wang, Coupling for Ornstein- Uhlenbeck Processes with Jumps, to appear in 
BerlouUi, available online arXiv: 1002.2890. 

[33] F.-Y. Wang, Gradient Estimate for Ornstein- Uhlenheck Jump Processes, to appear 
in Stoch. Proc. Appl., available online [a"rXiv:1005. 50231 

[34] F.-Y. Wang, L. Xu, Derivative formula and applications for hyperdissipative stochas- 
tic Navier-Stokes/Burgers equations, available online [arXiv: 1009. 14641 

[35] T.-S. Zhang, White noise driven SPDEs with reflection: strong Feller properties and 
Harnack inequalities. Potential Anal. 33 (2010), 137-151. 



16 



